Basic Statistics

Raw Counts

Name Value
Rows 10,000
Columns 16
Discrete columns 14
Continuous columns 2
All missing columns 0
Missing observations 0
Complete Rows 10,000
Total observations 160,000
Memory allocation 10.7 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 11 columns ignored with more than 50 categories.
## id: 10000 categories
## dateAdded: 9455 categories
## dateUpdated: 9327 categories
## address: 9954 categories
## categories: 5670 categories
## city: 2810 categories
## keys: 10000 categories
## name: 1545 categories
## postalCode: 5310 categories
## sourceURLs: 10000 categories
## websites: 6216 categories

QQ Plot

Correlation Analysis

## 12 features with more than 20 categories ignored!
## id: 10000 categories
## dateAdded: 9455 categories
## dateUpdated: 9327 categories
## address: 9954 categories
## categories: 5670 categories
## city: 2810 categories
## keys: 10000 categories
## name: 1545 categories
## postalCode: 5310 categories
## province: 47 categories
## sourceURLs: 10000 categories
## websites: 6216 categories
## Warning in cor(x = structure(list(latitude = c(40.39629, 39.08135, 39.09148, : the standard deviation is zero
## Warning: Removed 12 rows containing missing values or values outside the scale range (`geom_text()`).

Principal Component Analysis

## 11 features with more than 50 categories ignored!
## id: 10000 categories
## dateAdded: 9455 categories
## dateUpdated: 9327 categories
## address: 9954 categories
## categories: 5670 categories
## city: 2810 categories
## keys: 10000 categories
## name: 1545 categories
## postalCode: 5310 categories
## sourceURLs: 10000 categories
## websites: 6216 categories
## Warning in plot_prcomp(data = structure(list(id = c("AWrSh_KgsVYjT2BJAzaH", : The following features are dropped due to zero variance:
##  * primaryCategories_Accommodation...Food.Services
##  * country_US